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Summary 


The  basic  objective  of  the  CCMT  Officers  project  is  to  develop  an  econometric  model 
that  captures  the  promotion  trajectory  for  the  individual  officers,  conditional  on  past 
behavior  and  performance  as  well  as  on  past  (and  current)  economic  and  policy 
conditions.  The  model  is  also  conditional  on  staying  in  the  Navy  the  minimal  time  for 
promotion.  The  estimated  results  are  also  used  as  a  forecasting  tool  for  the  policy 
maker.  The  econometric  framework  used  here  is  a  two-step  first  order  Markov  model 
that  accommodates  time  dependent  information,  cohort  information,  censoring 
problems  with  the  data  as  well  as  incorporating  macro  economic  and  policy  level 
information.  In  the  first  step,  the  conditional  probabilities  of  staying  or  leaving  the  Navy 
are  estimated  conditional  on  staying  in  the  Navy  enough  time  to  be  eligible  for 
promotion.  In  the  second  step  of  the  estimation,  the  promotion  probability  within  the 
next  year  (conditional  on  all  the  independent  variables)  is  estimated.  The  estimation 
model  is  an  Information-Theoretic,  Generalized  Maximum  Entropy  model  that  is 
nonparametric  in  the  distribution. 


vn 


Table  of  Contents 


I  ntroduction  and  Objectives . 1 

The  Markov  Model . 1 

The  Basic  Promotion  Model . l 

Introducing  the  Individual  Level  Information  (Covariates) . 2 

Introducing  the  Global/Macro-Level  Variables . 3 

The  Estimation  Method . 4 

The  Information-Theoretic,  Generalized  Maximum  Entropy  Basic  Model . 4 

The  concentrated  (dual)  GME  method  is . 5 

Introducing  Prior  Information . 6 

Forecasting  and  Other  Statistics . 7 

Empirical  Results . 8 

Data  and  Analysis . 8 

The  Basic  Analysis  -  The  Aviation  Skill  Group  Example . 11 

Concluding  Remarks . 18 

References . 19 

Appendix  A:  Variables  and  Models . A-0 

List  of  Figures 

1.  Officer  Skill  Groups . 8 

2.  Final  list  of  variables  (with  a  brief  description) 

used  in  CCMT  Officers  Project . 10 

3.  Aviations  -  Pre  September  2001  Step-One:  Stay  -  Loss . 11 

4.  Step-Two:  Promotion  Probabilities  (Within  a  Year) . 12 

5.  Aviation  -  Mean  Forecasting  Promotion  Probabilities 

for  T=2,  4,  7  years  forward . 13 

6.  Aviation  -  Post  September  2001  Step-One:  Stay  Loss . 14 

7.  Step-Two:  Promotion  Probabilities  (Within  a  Year) . 15 

8.  Aviation  -  Mean  Forecasting  Promotion  Probabilities  T=2,  4,  7  years  forward . 16 

List  of  Tables 

1.  Career  Paths,  Skill  Group:  AVI . 17 

2.  Career  Paths,  Skill  Group:  SWO . 17 


IX 


I  ntroduction  and  Objectives 


The  basic  objective  of  the  Officers  promotion  modeling  project  was  to  develop  a 
promotion  probability  model  for  individual  officers  as  well  as  to  provide  a  forecasting 
tool  for  the  policy  maker.  In  a  previous  project,  a  theoretical  transition  probability 
model  was  developed.  It  is  a  first  order  Markov  model  that  accommodates  time 
dependent  information,  cohort  information,  censoring  problems  with  the  data  as  well  as 
incorporating  macro  economic  and  policy  level  information.1  The  estimation  model 
specified  for  this  project  was  an  Information-Theoretic,  Generalized  Maximum  Entropy 
model  that  is  a  semi-parametric  model.  The  Markov  model  developed  here  is  a  two-step 
model.  In  the  first  step,  the  probability  of  staying  or  leaving  the  Navy  for  a  period  of 
time  (based  on  Navy  rules  specified  on  the  years  since  commission)  is  calculated. 
Conditional  on  that,  in  the  second  step  the  promotion  probability  is  estimated. 

The  promotion  structure  of  officers  is  very  different  than  that  of  the  enlisted 
personnel.  Therefore,  an  important  aspect  of  this  project  was  to  test  whether  an 
econometric  model  can  explain  much  of  the  promotions  of  officers.  We  find  here,  that  it 
is  possible.  There  are  three  sets  of  complete  estimates  for  each  skill  group  analyzed.  The 
first  is  based  on  the  complete  data  set.  The  second  is  based  on  all  data  prior  to 
September  11,  2001  (peace  period).  The  third  is  based  on  all  data  after  September  11, 
2001  (war  period). 

In  the  Section  2  the  basic  Markov  model  is  defined  and  described.  The  estimator  is 
described  in  Section  3.  Some  of  the  basic  statistics  are  discussed  as  well  in  that  Section. 
The  data  and  basic  analysis  are  described  in  Section  4.  To  keep  this  report  a  reasonable 
size,  the  basic  results  for  the  Aviation  group  are  summarized  here  as  an  example.  These 
results  demonstrate  the  capabilities  of  the  estimations  done  and  the  resulting  forecasts. 
Appendix  A  provides  a  summary  of  the  data  used  in  the  final  estimates  and  some  of  the 
models  investigated. 


The  Basic  Markov  Model 


The  Basic  Promotion  Model 

The  best  model  to  capture  the  officers’  promotion  is  the  two-step  first  order  Markov 
transition  probabilities  model.  Conditional  on  a  first  step  that  estimates  the  conditional 
probability  of  staying  in  the  Navy,  the  transition/promotion  process  is  as  follows. 

Let  yitj  be  state./  (grade  level  j=03, ...,  06)  of  individual  i  in  period  t.  Specifically,  for 
each  individual  i  (i-i,  2,  ...,ri)  let  yitj  =  1  if  state./  (j=i,  2, K)  is  observed  at  period  t 
( t=i ,  2, T ),  and  yit]  =  0  for  all  other  K-i  states.  Next,  define  the  K  x  K  matrix  of 


1  Unpublished  technical  report;  ‘Career  Case  Manager  Technologies  for  Enlisted  Personnel’ 
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transition  (promotion)  probabilities  P  =  ( pkj )  representing  the  probability  of  promotion 

from  state  (grade)  k  to  state  (grade)  k+i=j.  One  of  the  K  states  represents  leaving  the 
navy  (it  can  be  refined  to  voluntary  and  non- voluntary  exits).  Since  an  individual  can  be 
promoted  just  one  grade  at  a  time,  and  assuming  no  individual  is  being  demoted  to  a 
lower  grade  (note  that  in  the  empirical  model  we  allow  for  demotion),  in  this  model 
pkj  =  0  for  all  j<k,  and  for  al\j>k+i. 

The  basic  relationship  between  period  ( t-i )  and  period  t  is  captured  via  the  KxK 
matrix  of  transition  (promotion)  probabilities 

K 

yw=  Z  Pk,yu-ik+  Z  A7>Vu  =Z^^-u  C1) 

p!,t-l.i=0}  {y:,i-i,t*0}  k=x 

where  the  middle  two  terms  represent  the  two  subsets  of  the  data:  the  case  where 
previous  (K-i)  states  are  zero  and  the  observed  k  state. 

Taking  into  account  the  noise  in  the  observed  data,  the  correct  noisy  observed  model  is 

K 

yitj  =  Z  ft,JVu+  Z  Pk,yi,<-u+£itj  =Z^hv-u+%  (2) 

k«-u=°}  {jv-u*0}  k=x 

where  e  represents  the  noise  in  the  data.  In  this  model,  each  e  is  naturally  bounded  in 

[-1,  l],  with  expected  mean  value  of  zero.  We  allow  correlations  across  time  in  that 
model.  The  exact  covariance  structure  and  the  noise  structure  will  be  finalized  and 
discussed  in  the  empirical  part. 

I  ntroducing  the  I  ndividual  Level  I  nformation  (Covariates) 

There  are  two  types  of  individual  level  covariates  in  this  model:  Time  Dependent  and 
Time  Independent. 

1.  Let  the  time-dependent  variables  (covariates)  be  x°t/  for  1=1,2, L.  These 

variables  capture  the  information  that  may  change  over  time,  such  as  the 
individual’s  evaluations,  the  individual’s  standing  within  her/his  cohort,  previous 
jobs,  previous  behavioral  problems,  changes  in  personal  characteristics  (married, 
children,  additional  education,  etc.). 

2.  Let  the  time-independent  variables  (covariates)  be  x)h  for  h=i,2, H.  These 
variables  capture  the  information  that  does  not  change  over  time,  including  the 
individual’s  basic  characteristics  (AFQT,  race,  gender,  basic  education2  level, 
etc.). 
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Note  that  education  may  change  over  time,  and  in  that  case  it  is  added  as  a  time -dependent  variable. 
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Next,  we  define  X  =  [x0X‘  ]  as  the  matrix  of  ah  of  the  individual-level  variables 
(demographics,  socio-economic  and  past  “behavior”  and  performance  in  the  Navy).  This 
matrix  is  of  dimension  ( NxT)x(L  +  H)  =  (NxT)xS  for  S=L+H  and  s=i,2,..,L,  L+i,...,  S. 

There  is  a  total  of  S  individual  level  variables,  where  the  last  H  variables  do  not  vary  over 
time  for  each  individual  i-i,  2, N. 

To  capture  the  relationship  between  the  observed  data,  y,  the  unknown  probabilities 
P,  and  the  covariates  X,  we  introduce  the  following  (cross  moments)  relationship: 


T  N 


T  N  K 


T  N  K 


T  N 


t=2  i=l 


t=2  i=l 


XXv*=XX  X  Pkjxu- Is  +  XX  X  Pkjyu-uxu-u  +  XXwiM- 

t-2  i- 1  {y/^i^-O}  t-2  i- 1 

T- 1  N  K  T-\  N  K  T- 1  N 

=  XX  X  ft,xiB+XX  X  p^tt^+XXv*. 

t=\  i= 1  {yitk  =0}  t= 1  i=l  {yitk* 0} 

r-i  /v  a:  r-i  n 


XXX  Pay*  xus  +  X  X  ^ 


itjXits 


t= 1  i=l  i=l 


«=1  (=1 


A  discussion  of  all  variables  is  provided  below. 

I  ntroducing  the  Global/  Macro- Level  Variables 


The  next  step  is  to  extend  this  framework  to  include  economic  and  political  variables. 
These  variables  may  have  direct  or  indirect  effects  on  the  individuals’  promotion 
probabilities  thorough  their  global  effect  on  the  navy,  the  individual’s  specific  skill  (or 
profession),  or  the  environment  as  a  whole. 


Specifically,  let  z?be  a  G-dimensional  vector  of  global  variables  for  each  period  t, 

composed  of  G  macro  and  policy  variables  (or  instruments).  As  we  do  not  know  the 
direct  impact  and  relationship  between  these  set  of  variables  and  our  data  y,  the 
covariates  X,  or  the  promotion  probabilities  P,  we  introduce  this  information  via  the 
cross-moments  of  the  global  and  the  individual-level  variables.  There  are  two  ways  to  do 
so.  The  first  one  is  the  more  general  case  which  allows  interaction  between  these  global 
quantities  and  each  one  of  the  individual’s  characteristics.  The  second  case  is  done  by 
incorporating  these  global  variables  into  the  X  matrix.  After  testing  the  data,  in  the 
empirical  estimation  we  use  the  second  case  which  is  described  below. 

A  =  T  X°X1zl 

Define  L  J  as  the  matrix  of  all  of  the  variables  (individual  level  variables, 
the  X’s,  and  global  variables,  the  Z’s).  This  matrix  is  of  dimension 


(N  x  T)  x  (L  +  H  +  G) .  (N  x  T)  x  S*  forS,  =L+H+G  and  L+H_  __ 


L+H+i,...,  L+H+G.  There  are  a  total  of  ss  variables.  Rewriting  Eq.  (3)  yields 


(3) 
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T  N 


ss.v 


W  ^ its 8 


T- 1  N  K 


T- 1  N 


=  XXXp^v  +XX 


(3A) 


The  global-macro  level  variables  enter  as  lag  variables  (with  one  to  three  periods  lag, 
and  as  differenced  lag  variables)  where,  again  the  empirical  results  will  determine  the 
exact  variables  used. 


The  Estimation  Method 


The  I  nformation-Theoretic,  Generalized  Maximum  Entropy  Basic  Model 

To  simplify  the  presentations  and  notations,  we  formulate  the  basic  model  with 
covariates  (or  the  second  case  presented  in  3A  above).  Following  the  Information 
Theoretic  (IT)  -  Generalized  Maximum  Entropy  (GME)  approach  we  first,  reformulate 
the  noise  components  eitj  to  be  proper  probabilities  defined  on  some  support  space  v. 

For  background  on  the  GME  and  related  work  see  Golan,  Judge  and  Miller  (1996)  and 
Golan,  Judge  and  Perloff  (1996).  For  background  and  recent  work  on  IT  and  its 
relationship  to  GME  and  other  methods  of  estimation  and  inference  see  special  issues  of 
the  Journal  of  Econometrics  (2002,  2007).  Rewriting  Eq.  (3A)  yields 

T  N  T-\  N  K  T- 1  N 

X  X  y^us = X  X  X  Pkjyukhts + X  X  xits£«j 

t=2  i=l  t= 1  i=l  k= 1  1=1  i=l 

T- 1  N  K  T-\  N  M 

=zzz  PkjyitkXits  +  X  X  X  XitsWUjmVm 


M  M 

with  X  wujm  =  1  and  where  sitj  =  X%,vm  for  M  >  2 .  Since  sitj  e  [-1,1]  for  all  i,  t,j,  then 

m—l  m= 1 

vm  €  [-  l,l]  and  v  is  a  symmetric  around  zero  support  space  for  each  random  error 

defined  above.  In  the  empirical  analysis,  examples  and  discussions  of  the  size  of  M  as 
well  as  using  continuous  support  spaces  will  be  provided. 

By  now,  we  have  reformulated  the  basic  Markov  model  to  include  all  of  the  available 
information  (personal  and  global).  We  also  have  converted  the  unknown  errors  to  be 
fully  represented  by  a  set  of  proper  probabilities  (W),  so  all  of  the  unknown  quantities 
here  (P  and  W)  are  proper  probability  distributions.  We  can  now  construct  the  GME 
estimation  method  which  maximizes  the  joint  Shannon  (1948)  entropies  of  the  signal,  P, 
and  the  noise,  W,  subject  to  the  available  information  (the  data)  and  the  requirement 
that  both  P  and  W  are  proper  probability  distributions. 

The  basic  GME  Markov  model  is 


4 


Max  l  pkj  log  pkj  -  X  w<tN  log  witjm  \ 
k,j  itjm  J 

{pM 

s.t. 

(5) 


ZZ^v 


f=2  / 


r-i  a:  r-i 

ZZZ  Pkj  Visits  +  zzz  ^ itjm  ^ its  bn 


t= 1  i  k=\ 


t= 1  i  m 


Zft/  =  !;Zw«n  =1 

j  m 


The  GME  solution  is 


A  r-i 


exp 


Pkj  = 


ZZw, 

4  f=i  1,5 


f  T-l  ^ 

Zexp  -ZZw> 

y  v  *=1  »>  y 


^  r-i  A 

exp  -ZZ 

y  f=l  1,5 

Q,, 


(6a) 


and 


exp 


-Z**v«4 


w. .  = 

vvitjm 


V  5 


exp 


-Zx^h„4 


V  s 


Zexp  -Xxilsvmijs 


V  5 


XF, 


(6b) 


0 

where  js  are  the  J  x  ^  estimated  Lagrange  multipliers  associate  with  the  data  (Eq.  4), 
and  the  estimated  noise  components  are  e  -Z 

m 

The  concentrated  (dual)  GME  method  is 

Instead  of  using  the  constrained  optimization  estimation  model  (5),  the  GME  can  be 
formulated  as  an  unconstrained,  concentrated  (or  a  generalized  likelihood)  model: 
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(7) 


44  =  +  Xk)§ 

t= 2  j= 1  i',5  k 


Zcxp  -EEft*ftA 


\  t=\  i,s 


+ 


Elo§ 


ht,J 


(  \ 

ZCXP  ~XX^V-A; 


V  s 


=  ZZZ-V"/X"'4 + X10^  (  )toE  ) 

1=2  7=1  i,s  k  i,t,j 

where  Ajs  are  the  J  x  S  Lagrange  multipliers  which  are  the  real  set  of  unknown  and 
unobserved  quantities  in  this  model.  Minimizing  (7)  and  solving  for  A,  yields  the 

/V 

estimated  A  ,  which  in  turn  yield  the  optimal  probabilities  pkj  and  wiljm  via  relationship 
(6)3. 

I  introducing  Prior  I  nformation 

Within  the  approach  taken  here,  it  is  easy  to  introduce  prior  information 
representing  some  prior  knowledge  or  belief  of  the  Markov  probabilities  P,  call  it  P° . 

This  is  done  by  substituting  the  entropy  objective  with  the  cross-entropy  or  Kulback- 
Liebler  information-divergence  measure.  Rewriting  the  objective  of  our  GME  estimator 
as 

i(p,w-,p°,w°)=  (9) 

k,j  itjm 


and  minimizing  I(P,W)  with  respect  to  (3A  or  4)  and  the  requirements  of  proper 
probabilities  (Eft;  =  ';EMV»  =  ' )  yields  the  optimal  solutions  for  the  Generalized 

j  m 


Cross  Entropy  (GCE)  model 

(  T- 1 


ft  exp 


E  E  y itkXitsZtg^jsg 


Pki  = 


V  t=i  i’S’g 


I 

Eft  ex p  XXftAft4r 


V  ,=1  i’s’g 


( r-i  A 

plj  exp  ZZ  yukxitsztg * 

V  ,=1  ‘’s’g 

G, 


Pg 


(10  A) 


and 


3  To  keep  this  report  from  expanding  too  much,  the  relevant  statistics  as  well  as  the  variance-covariance  matrix  will 
be  presented  with  the  empirical  findings.  However,  these  quantities  are  available  upon  request. 
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(ioA) 


f 


Witj,n  eXP 


^its  Ztg  Vm  4 


W  = 

itjm 


V 


A 


Wi,jm  eXP 


j 


TjXitsZtgVm^ 

V  s.s 


Z  'N 

Zw°y»-exP  Hxitsztgvmhg 
m  V  s’8  J 


TV. 

“J 


where  now  we  introduced  the  macro  level  variables  Z  in  the  more  general  framework 
(“first  case”  discuss  in  Section  2.3).  Finally,  e  =  V  witjmvm  .  Note  that  the  priors  for  the 


noise  terms  are  always  taken  to  be  uniform  and  are  incorporated  here  just  for 
generalization. 

The  dual,  concentrated  GCE  is  just 


T  K 


*M=ZZZ  -  Z  log 

t= 2  7=1  i 

-Zlog 


i,t,J 
T  K 


S,g 

k 

j 

- 

f 

y 

Z  WUjm  eXP 

Z 

2. 
m  sjg 

m 

V  S’S 

J. 

Z  exp  Z  Z  ymvAj 


V  ?=1 


=ZZZwA-Zlo^(  )toE  ) 

t= 2  7=1  /,5,g  ^  i,t,j 


(n) 


where  both  normalization  factors QA(z)  and  T;;(/l)  are  defined  in  (10A-10B  )  above. 


Forecasting  and  Other  Statistics 


Based  on  the  estimated  parameters,  we  can  (1)  forecast  the  mean  behavior  of  the  skill 
group  analyzed  many  periods  into  the  future;  (2)  forecast  each  officer’s  potential 
promotion  trajectory,  for  many  periods  into  the  future,  based  on  her/his  personal 
characteristics,  past  performance  and  other  macro/policy  variables;  (3)  forecast  the 
individuals  (or  group  level)  promotion  trajectories  under  certain  constraints,  such  as 
geographic  restrictions;  and  (4)  simulate  different  policy  and  other  global  scenarios, 
such  as  the  impact  of  expanding/shrinking  a  certain  skill  group,  or  the  impact  of  a 
booming  economy  on  the  promotion  trajectories.  Technically,  these  forecasts  are  a 
simple  multiplication  of  the  promotion  matrix  while  accommodating  for  the  time 
varying  variables.  For  example,  if  P  is  the  5  by  5  matrix  of  promotion  probability  (with 
the  states:  O3,  O4,  O5,  06  and  Loss),  then  PxP  is  a  5  by  5  promotion  probabilities  two 
periods  into  the  future. 

The  impact  of  each  one  of  the  right  hand  side  variables  on  the  promotion 
probabilities  of  each  individual  sailor  are  captured  via  the  Marginal  Effects.  Technically, 
the  Marginal  Effects  of  individual  i,  at  period  t,  are: 

^  -/^(a-Z./aa,)- 
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Empirical  Results 


Data  and  Analysis 

Five  officers’  skill  groups  are  analyzed  here:  Aviation  (AVI),  Surface  Warfare  Group 
(SWO),  Special  Operations  (SPOPS),  Special  Warfare  (SPWAR),  and  Submarines  (SUB). 
The  dates  for  the  data  used  for  each  skill  group  are: 


Table.l.  Officer  Skill  Groups 


Skill  Group 

Start 

End 

Aviation 

Jan  96 

Aug  06 

SWO 

Jan  96 

Sep  06 

Submariner 

Jan  96 

Mar  07 

SP OPS 

Jan  96 

Apr  07 

SP WAR 

Jan  96 

Apr  07 

For  each  skill  group  we  provide  estimates  based  on  the  full  data  set  (with  a  dummy 
variable  for  September,  2001),  and  on  the  two  mutually  exclusive  subsets  of  Pre  and 
Post  September  2001.  This  is  because  our  basic  hypothesis  here  is  that  the  basic  set  of 
preferences  and  policies  have  changed  after  September,  2001.  Therefore,  it  is  more 
accurate  to  look  at  that  data  separately.  Because  the  method  of  estimation  used  here 
allows  us  to  analyze  relatively  smaller  samples,  we  are  able  to  do  it. 

The  basic  variables  used  in  all  the  models  are  summarized  in  the  following  table 
(Table  4.1).  We  note  that  many  more  variables  were  studied  and  analyzed,  but  we  are 
just  discussing  here  those  variables  that  are  included  in  the  final  set  of  models.  The 
decision  of  what  variables  should  be  in  the  final  model,  is  based  on  econometric  tests  as 
well  as  on  basic  information  provided  by  the  Navy. 

As  was  discussed  above,  this  is  a  two-step  model. .  We  need  to  prepare  the  data 
according  to  the  basic  Navy  rules.  These  rules  are  defined  in  terms  of  the  Years 
Commissioned  Service  (YCS)  and  .  promotion  opportunity  and  selection  rate  and  refers 
to  the  percentage  of  all  officers  selected  for  promotion.  Based  on  Navy  and  DoD 
documentation,  we  used  the  following  policy  assumptions: 

1.  O4  -  timing  10  years  +/- 1  year,  opportunity  80%  +/- 10% 

2.  O5  -  timing  16  years  +/- 1  year,  opportunity  70%  +/- 10% 

3.  06  -  timing  22  years  +/- 1  yr,  opportunity  50%  +/- 10% 

4.  For  O3  and  O2  all  qualified 

Based  on  this,  the  rules  we  used  in  the  analysis  are: 

O3  to  O4:  Minimum  of  9  years  YCS 

O4  to  O5:  Minimum  of  15  years  YCS 
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O5  to  06:  Minimum  of  21  years  YCS 

Finally,  given  that  there  is  no  stated  YCS  for  promotion  to  O7,  we  use  an  additional 
six  year  window  from  previous  rank  to  define  it  as: 

06  to  O7:  Minimum  of  27  years  YCS. 

In  terms  of  time  in  rank,  this  translates  to  effectively  six  years  in  rank  (minimum) 
before  an  officer  can  be  promoted  to  O4,  O5,  06  and  O7. 

The  office  data  is  an  unbalanced  panel  data  set,  therefore,  we  controlled  for  left  and 
right  censoring.  Appendix  A  describes  all  of  the  variables  used  in  the  final  model  and 
lists  the  major  models  investigated  prior  to  finding  the  final  model.  We  estimated  the 
promotion  probabilities  for  each  one  of  the  five  officer  skill  groups.  For  purposes  of 
illustration  only  the  results  for  the  Aviation  skill  group  are  discussed.  Three  sets  of 
results  are  reported  for  each  for  the  Aviation  group:  the  full  sample,  Pre  September, 
2001  and  Post  September,  2001. 
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Table.2. 

Final  list  of  variables  (with  a  brief  description)  used  in  CCMT  Officers  Project 


Categorical/  Discrete  Variables 

GENDER 

Male  =  1 

RACE_** 

Race  (empirical  flag) 

PAQD** 

Primary  AQD  (empirical  flag) 

SSFUNC.** 

Sub-specialty  -  grouped  by  FUNCTION  (empirical  flags) 

FLTCONC** 

Fleet  concentration  (empirical  flags) 

SS SEA  /  SS 0TH 

Sea,  shore,  or  other  duty. 

EDUC MAP 

Education  level  (MA  plus),  rest  are  reference  category 

Designation  Category  Code  (empirical  flags) 

APC MISS 

If  APC  score  is  missing 

JC* 

Joint  Specialty  Codes  (empirical  flags) 

SCREENING.** 

Screening  variables  (NO=none;  N1=C0_SCRN; 

N2=DH SCRN;  N3=MAJCMD SCRN;  and  N4=X0 SCRN) 

Continuous  Variables 

TIR,  TIR2 

Time  in  rank,  Time  in  rank  square 

YCS,  YCS2 

Years  since  commission,  and  its  square 

APC.INDEX 

APC  Measures  (additive  index  based  on  academic  profile 
code) 

NUMAQD 

Total  number  of  AQDs  (additional  qualification  designation) 
to  date 

AVIA_MOS_OPER_FLY 

Only  for  aviation  (Months  of  Operational  Flying) 

PROFEXP 

Performance  evaluation  variables 

EQUALOPP 

MILBEAR 


TEAMWORK 

LEADER 


TACTPERF 
RS  SUMAVG 


MTGAGE.l 

NASDAQ.1 

R_GDP_1 

UNEMPL.l 

Macro  Variables  (lagged  one  month) 

MTGAGE.12L 

Macro  Variables  (12  month  lagged  moving  average),  where 

NASDAQ.12L 

applicable. 

R  GDP  12L 

UNEMPL.12L 
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The  Basic  Analysis  -  The  Aviation  Skill  Group  Example 

In  this  section,  a  summary  of  model  estimates  for  the  Aviation  skill  group  is 
provided.  All  results  presented  here  are  in  terms  of  averages  for  the  Aviation  group.  The 
model,  however,  also  provides  the  possibility  of  individuals’  forecasts. 

Two  sets  of  results  are  summarized  here:  Pre  and  Post  September,  2001.  In  each 
case,  the  two  stages  of  the  analysis  are  presented:  Stay- Loss  (Step  One)  and  the  within 
year  promotion  probabilities.  Each  transition  in  the  promotion  stage  reflects  the 
progression  from  one  year  to  the  next.  The  forecasting  tables  represent  the  transition 
probability  within  1  to  7  years  from  each  grade  on. 

Table  3. 

Aviations  -  Pre  September  2001 
Step- One:  Stay  -  Loss 


Estimated  Transition  Probabilities 


03 

04 

05 

06 

LO 

03 

0.985 

0.000 

0.000 

0.000 

0.015 

04 

0.000 

0.977 

0.003 

0.000 

0.020 

05 

0.000 

0.000 

0.979 

0.001 

0.020 

06 

0.000 

0.000 

0.000 

0.941 

0.059 

Actual  Transition  Counts 

03 

04 

05 

06 

LO 

TOTAL 

03 

11452 

2 

0 

0 

163 

11617 

04 

0 

3962 

28 

0 

161 

4151 

05 

0 

0 

3546 

10 

47 

3603 

06 

0 

0 

0 

1268 

99 

1367 

Predicted  Transition  Counts 

03 

04 

05 

06 

LO 

03 

11439 

0 

0 

0 

178 

11617 

04 

0 

4056 

12 

0 

83 

4151 

05 

0 

0 

3527 

4 

73 

3603 

06 

0 

0 

0 

1287 

80 

1367 
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Table  4. 

Step-Two:  Promotion  Probabilities  (Within  a  Year) 


Prior  Probabilities  Used 

03 

04 

05 

06 

LO 

03 

0.832 

0.096 

0.000 

0.000 

0.072 

04 

0.000 

0.836 

0.111 

0.000 

0.053 

05 

0.000 

0.000 

0.864 

0.069 

0.067 

06 

0.000 

0.000 

0.000 

0.873 

0.127 

Estimated  Transition  Probabilities 

03 

04 

05 

06 

LO 

03 

0.672 

0.163 

0.000 

0.000 

0.165 

04 

0.000 

0.775 

0.225 

0.000 

0.000 

05 

0.000 

0.000 

0.551 

0.137 

0.313 

06 

0.000 

0.000 

0.000 

0.854 

0.146 

Actual  Transition  Counts 

03 

04 

05 

06 

LO 

TOTAL 

03 

8114 

2145 

0 

0 

2015 

12274 

04 

0 

2631 

960 

0 

269 

3860 

05 

0 

0 

1194 

420 

351 

1965 

06 

0 

0 

0 

750 

216 

966 

Predicted  Transition  Counts 

03 

04 

05 

06 

LO 

03 

8243 

2003 

0 

0 

2028 

12274 

04 

0 

2992 

868 

0 

0 

3860 

05 

0 

0 

1082 

269 

614 

1965 

06 

0 

0 

0 

825 

141 

966 
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Table  5. 

Aviation  -  Mean  Forecasting  Promotion 
Probabilities  for  T=2,  4,  7  years  forward 


T=2 

03 

PRE-9/  11 

04  05 

06 

LO 

03 

0.451 

0.236 

0.037 

0.000 

0.276 

04 

0.000 

0.601 

0.298 

0.031 

0.070 

05 

0.000 

0.000 

0.303 

0.192 

0.505 

06 

0.000 

0.000 

0.000 

0.729 

0.271 

LO 

0.000 

0.000 

0.000 

0.000 

1.000 

T=4 

03 

04 

05 

06 

LO 

03 

0.204 

0.248 

0.098 

0.014 

0.436 

04 

0.000 

0.361 

0.270 

0.098 

0.271 

05 

0.000 

0.000 

0.092 

0.198 

0.710 

06 

0.000 

0.000 

0.000 

0.532 

0.468 

LO 

0.000 

0.000 

0.000 

0.000 

1.000 

T=7 

03 

04 

05 

06 

LO 

03 

0.062 

0.168 

0.106 

0.047 

0.618 

04 

0.000 

0.168 

0.153 

0.141 

0.538 

05 

0.000 

0.000 

0.015 

0.142 

0.842 

06 

0.000 

0.000 

0.000 

0.331 

0.669 

LO 

0.000 

0.000 

0.000 

0.000 

1.000 

FROM  03 

TO  03 

TO  04 

TO  05 

TO  06 

LOSS 

YR1 

0.672 

0.163 

0.000 

0.000 

0.165 

YR2 

0.451 

0.236 

0.037 

0.000 

0.276 

YR3 

0.303 

0.257 

0.073 

0.005 

0.362 

YR4 

0.204 

0.248 

0.098 

0.014 

0.436 

YR5 

0.137 

0.226 

0.110 

0.026 

0.502 

YR6 

0.092 

0.197 

0.111 

0.037 

0.563 

YR7 

0.062 

0.168 

0.106 

0.047 

0.618 

FROM  04 

TO  03 

TO  04 

TO  05 

TO  06 

LOSS 

YR1 

0.000 

0.775 

0.225 

0.000 

0.000 

YR2 

0.000 

0.601 

0.298 

0.031 

0.070 

YR3 

0.000 

0.466 

0.299 

0.067 

0.168 

YR4 

0.000 

0.361 

0.270 

0.098 

0.271 

YR5 

0.000 

0.280 

0.230 

0.121 

0.370 

YR6 

0.000 

0.217 

0.189 

0.135 

0.459 

YR7 

0.000 

0.168 

0.153 

0.141 

0.538 

FROM  05 

TO  03 

TO  04 

TO  05 

TO  06 

LOSS 

YR1 

0.000 

0.000 

0.551 

0.137 

0.313 

YR2 

0.000 

0.000 

0.303 

0.192 

0.505 

YR3 

0.000 

0.000 

0.167 

0.206 

0.628 

YR4 

0.000 

0.000 

0.092 

0.198 

0.710 

YR5 

0.000 

0.000 

0.051 

0.182 

0.767 

YR6 

0.000 

0.000 

0.028 

0.162 

0.810 

YR7 

0.000 

0.000 

0.015 

0.142 

0.842 
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Table  6. 

Aviation  -  Post  September  2001 
Step- One:  Stay  Loss 


Estimated  Transition  Probabilities 

03  04  05  06  LO 

03 

0.985 

0.000 

0.000 

0.000 

0.015 

04 

0.000 

0.950 

0.003 

0.000 

0.047 

05 

0.000 

0.000 

0.983 

0.000 

0.017 

06 

0.000 

0.000 

0.000 

0.940 

0.060 

Actual  Transition  Counts 

03 

04 

05 

06 

LO 

TOTAL 

03 

7137 

8 

0 

0 

75 

7220 

04 

0 

5343 

3 

0 

317 

5663 

05 

0 

0 

3410 

12 

36 

3458 

06 

0 

0 

0 

1351 

86 

1437 

Predicted  Transition  Counts 

03 

04 

05 

06 

LO 

03 

7112 

0 

0 

0 

108 

7220 

04 

0 

5379 

19 

0 

265 

5663 

05 

0 

0 

3398 

0 

60 

3458 

06 

0 

0 

0 

1351 

86 

1437 
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Table  7. 

Step-Two:  Promotion  Probabilities  (Within  a  Year) 


Estimated  Transition  Probabilities 

03  04  05  06  LO 

03 

0.591 

0.276 

0.000 

0.000 

0.133 

04 

0.000 

0.727 

0.229 

0.000 

0.044 

05 

0.000 

0.000 

0.710 

0.140 

0.150 

06 

0.000 

0.000 

0.000 

0.772 

0.228 

Actual  Transition  Counts 

03 

04 

05 

06 

LO 

TOTAL 

03 

4224 

2220 

0 

0 

973 

7417 

04 

0 

3196 

1261 

0 

285 

4742 

05 

0 

0 

1893 

436 

349 

2678 

06 

0 

0 

0 

604 

235 

839 

Predicted  Transition  Counts 

03 

04 

05 

06 

LO 

03 

4385 

2044 

0 

0 

988 

7417 

04 

0 

3450 

1085 

0 

207 

4742 

05 

0 

0 

1902 

374 

403 

2678 

06 

0 

0 

0 

647 

192 

839 
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Table  8. 

Aviation  -  Mean  Forecasting  Promotion 
Probabilities  T=2,  4,  7  years  forward 


POST-9/  11 


T=2 

03 

04 

05 

06 

LO 

03 

0.350 

0.364 

0.063 

0.000 

0.224 

04 

0.000 

0.529 

0.329 

0.032 

0.110 

05 

0.000 

0.000 

0.504 

0.207 

0.289 

06 

0.000 

0.000 

0.000 

0.596 

0.404 

LO 

0.000 

0.000 

0.000 

0.000 

1.000 

T=4 

03 

04 

05 

06 

LO 

03 

0.122 

0.319 

0.173 

0.025 

0.360 

04 

0.000 

0.280 

0.340 

0.104 

0.276 

05 

0.000 

0.000 

0.254 

0.227 

0.518 

06 

0.000 

0.000 

0.000 

0.355 

0.645 

LO 

0.000 

0.000 

0.000 

0.000 

1.000 

T=7 

03 

04 

05 

06 

LO 

03 

0.025 

0.167 

0.191 

0.075 

0.542 

04 

0.000 

0.108 

0.221 

0.146 

0.525 

05 

0.000 

0.000 

0.091 

0.163 

0.746 

06 

0.000 

0.000 

0.000 

0.163 

0.837 

LO 

0.000 

0.000 

0.000 

0.000 

1.000 

FROM  03 

TO  03 

TO  04 

TO  05 

TO  06 

LOSS 

YR1 

0.591 

0.276 

0.000 

0.000 

0.133 

YR2 

0.350 

0.364 

0.063 

0.000 

0.224 

YR3 

0.207 

0.361 

0.128 

0.009 

0.296 

YR4 

0.122 

0.319 

0.173 

0.025 

0.360 

YR5 

0.072 

0.266 

0.196 

0.043 

0.422 

YR6 

0.043 

0.213 

0.200 

0.061 

0.483 

YR7 

0.025 

0.167 

0.191 

0.075 

0.542 

FROM  04 

TO  03 

TO  04 

TO  05 

TO  06 

LOSS 

YR1 

0.000 

0.728 

0.229 

0.000 

0.044 

YR2 

0.000 

0.529 

0.329 

0.032 

0.110 

YR3 

0.000 

0.385 

0.355 

0.071 

0.190 

YR4 

0.000 

0.280 

0.340 

0.104 

0.276 

YR5 

0.000 

0.204 

0.306 

0.128 

0.363 

YR6 

0.000 

0.148 

0.264 

0.141 

0.447 

YR7 

0.000 

0.108 

0.221 

0.146 

0.525 

FROM  05 

TO  03 

TO  04 

TO  05 

TO  06 

LOSS 

YR1 

0.000 

0.000 

0.710 

0.140 

0.150 

YR2 

0.000 

0.000 

0.504 

0.207 

0.289 

YR3 

0.000 

0.000 

0.358 

0.230 

0.412 

YR4 

0.000 

0.000 

0.254 

0.227 

0.518 

YR5 

0.000 

0.000 

0.181 

0.211 

0.609 

YR6 

0.000 

0.000 

0.128 

0.188 

0.684 

YR7 

0.000 

0.000 

0.091 

0.163 

0.746 

The  above  results  demonstrate  the  capabilities  of  the  model  used.  Finally,  for 
comparison,  the  estimated  transitions  are  converted  into  mean  career  paths  for  each 
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skill  group.  The  Aviation  and  SWO  are  presented  below.  It  is  interesting  to  see  the 
differences  in  career  paths  between  the  Pre  and  Post  2001  (“peace”  vs.  “war”  periods) 
and  among  the  two  skill  groups  presented  here. 


T 


n - 1 - 1 - 1 - 1 - 1 - 

0  5  10  15  20  25  30 

Years  Since  Commissioned 


Figure  1.  Career  Paths,  Skill  Group:  AVI 


0  5  10  15  20  25  30  35 

Years  Since  Commissioned 


Figure  2.  Career  Paths,  Skill  Group:  SWO 
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Concluding  Remarks 


The  promotion  probability  Officers  project  is  an  experimental  project  where  the 
objective  was  to  model  the  promotion  structure  of  Navy  officers,  by  skill  groups, 
conditional  on  their  personal  characteristics  and  past  performance  as  well  as  on  other 
economic  and  policy  information.  The  model  developed  is  a  two  stage  Markov  model.  In 
the  first  stage  the  Stay- Loss  probabilities  are  analyzed.  Then,  in  the  second  stage  (and 
conditional  on  the  first  stage)  the  promotion  probability  for  each  officer  is  estimated. 

The  analysis  is  broken  into  two  mutually  exclusive  periods:  Before  September  2001  and 
after  that  date.  This  allows  us  to  capture  the  changes  in  behavior  and  policies  as  a 
transition  for  peaceful  period  to  a  war  era.  For  comparison  reasons,  the  analysis  of  the 
full  period  of  the  data  (with  a  dummy  variable  for  September,  2001)  is  provided. 

The  econometric  model  develop  is  an  information  theoretic,  generalized  maximum 
entropy  model.  This  model  is  nonparametric  in  the  distribution,  allows  us  to  incorporate 
prior  information  and  is  flexible  enough  to  analyze  small  data. 

In  the  project  we  were  able  to  estimate  the  individual’s  (officer)  promotion 
trajectory,  and  to  investigate  the  changes  to  that  trajectory  subject  to  different 
constraints.  These  constraints  can  be  individual-level  restrictions,  such  as  geographic 
requests,  or  global  requirements,  such  as  economic  or  policy  changes.  The  model 
developed  performs  well  with  the  data  analyzed. 

Additional  analysis  could  include  (i)  improvement  of  the  forecasts  by  allowing  for 
additional  information  (variables)  in  the  data  sets  (especially  education  and 
performance  related  information);  (ii)  expansion  to  additional  officers’  skill  groups,  and 
(iii)  perform  more  simulations  on  different  policy,  economic  and  performance  scenarios. 
In  addition,  an  analysis  of  gender,  race  and  other  sub  groups  of  interest  should  be 
performed. 


18 


References 


A.  Golan,  G.  G.  Judge  and  D.  Miller  (1996),  Maximum  Entropy  Econometrics:  Robust 
Estimation  with  Limited  Data,  John  Wiley  &  Sons. 

A.  Golan,  G.  G.  Judge  and  J.  Perloff,  “A  Generalized  Maximum  Entropy  Approach  to 
Recovering  Information  from  Multinomial  Response  Data,”  Journal  of  the 
American  Statistical  Association,  Vol.  91  (1996),  pp.  841-853. 

Journal  of  Econometrics.  Special  Two-Issues  on  “Information  and  Entropy 
Econometrics,”  Ed.  A.  Golan,  (Volume  107,  Issues  1-2,  March,  2002). 

Journal  of  Econometrics.  Special  Issue  on  “Information  and  Entropy  Econometrics,” 
based  on  selected  papers  presented  at  the  IEE  Conference,  Eds.  A.  Golan  and  Y. 
Kitamura,  (Volume  138,  2007). 

A.  Golan,  CCMT  -  unpublished  Technical  Report  I  (January,  2004) 

A.  Golan,  CCMT  -  unpublished  Technical  Report  II  (January,  2005) 


19 


Appendix  A: 
Variables  and  Models 


A-o 


In  this  Appendix,  the  final  set  of  models,  and  variables,  for  each  skill  group  is 
summarized. 

The  Variables: 

1.  Gender  (Male=i) 

2.  Race  (Empirical  dummies,  white  as  reference  category). 

3.  Fleet  Concentration  (Empirically  defined  dummy  variables) 

4.  Sea/ Shore  and  Other  category 

5.  Education  (all  MA  and  higher  education  =  1;  the  rest  are  o) 

6.  APC 

a.  APC_MISS  (if  APC  is  missing) 

b.  APC_INDEX  =  1  -  (APC_Q/5  +  APC_M/6  +  APC_T/s)/3  (defined  only 
where  APC_MISS  =  o) 

7.  Joint  Specialty  Code  (empirically  defined  dummy  variables) 

8.  POST911  (1  after  9/11  and  o  before) 

9.  Additional  Qualification  Designation  (AQD) 

a.  Primary  AQD  (empirical  dummies)  identifies  the  most  relevant  AQD  at 
current  observation  level. 

b.  Number  of  AQDs  (cumulative  count  of  the  number  of  AQDs  accumulated 
to  date. 

10.  Subspecialty  code  (at  the  function  level).  Empirical  dummies. 

11.  Screening  (empirically  defined  dummy  variables  for  category  1,  2,  3  and  4. 
Reference  category  is  when  Screening  is  missing  (not  screened  for  anything). 
Due  to  special  issues  with  AVI  data,  category  2  is  the  reference  for  AVI. 

12.  Time  in  Rank  and  Time  in  Rank  Squared 

13.  Years  Since  Commissioned  and  Years  Since  Commissioned  Squared 

14.  Individual  evaluations: 

a.  PROFEXP 

b.  EQUALOPP 

c.  MILBEAR 

d.  TEAMWORK 

e.  LEADER 

f.  TACTPERF 

15.  Rater  Average  Evaluations  (RS_SUMAVG) 

16.  Macro  Variables  (Lagged  1  period) 

a.  Mortgage  Rate 

b.  NASDAQ 

c.  Real  GDP 

d.  Unemployment  Rate 

17.  Number  of  Aviation  Hours  accumulated  (specific  to  AVI  data  set  only). 

18.  Recall,  this  is  a  two-step  (or  two  basic  “periods”)  model.  The  first  step  model 
is  for  a  period  when  individuals  are  not  supposed  to  be  promoted.  During  that 
period  individuals  can  either  stay  or  leave  (loss)  the  Navy.  A  very  small 
number  of  individuals  do  get  promoted  during  that  period.  The  next  period  is 
the  phase  where  individuals  are  allowed  to  be  promoted.  They  can  stay  at  the 
same  grade,  be  promoted,  or  leave  every  year.  The  distinction  between  the 
first  and  second  stage  is  made  with  a  dummy  variable  defined  as  follows: 


A-i 


i.  @03,  INRULE  =  iifYCS>=9, 

ii.  @04,  INRULE  =  1  if  YCS>=15, 

iii.  @05,  INRULE  =  1  if  YCS>=21, 

iv.  @06,  INRULE  =  1  if  YCS>=27 

On  an  average  this  corresponds  to  about  6  years  in  rank  before  promotion. 

This  takes  into  account  the  DoD  policy  plus/minus  one  year  (for  flexibility). 

The  final  model  used  is  described  in  the  text  and  provided  with  the  CD.  Here  we 
discuss  the  major  models  and  experiments  done  during  the  empirical  investigations 
toward  the  final  model. 

The  Basic  Models  Tested  and  Discarded: 

1.  Models  with  different  APC  measures  (continuous  variables  and  dummy 
variables).  In  addition  to  the  APC_INDEX  variables  described  above,  we  also 
estimated  models  with 

b.  APC_Q,  APC_M,  and  APC_T  variables  included  as  independent  variables. 
These  APC  measures  (Quality,  Math,  and  Technical)  were  treated  as 
continuous  measures  in  this  version. 

c.  High,  Medium,  and  Low  dummy  variable  versions  of  the  APC_Q,  APC_M, 
and  the  APC_T  variables  were  also  tried  separately.  In  both  cases,  these 
measures  were  deemed  to  be  insignificant  in  both  sets  of  models. 

d.  Although  this  does  not  constitute  a  model,  we  mined  the  data  to  see 
whether  the  promotion  ratios  varied  significantly  across  the  APC 
categories  within  Q,  M,  and  T.  We  found  that  they  did  not.  This  led  us  to 
ultimately  drop  the  dummy  variable  version  of  the  model  (described  in  the 
previous  paragraph). 

2.  Models  with  the  average  evaluation  rating  (TRAIT AVG)  included  as  a  predictor. 
Invariably,  this  measure  was  highly  correlated  with  the  constituent  components 
so  it  was  eventually  dropped. 

3.  Models  that  included  past  promotion  recommendations  (for  previous  ranks)  to 
capture  potential  unobserved  propensity.  This  however,  is  inconsistent  with  Navy 
policy  and  behavior  (i.e.,  according  to  Navy  researchers,  there  is  no  reason  to 
believe  that  past  recommendations  have  any  bearing  on  current  promotion 
prospects).  Therefore,  we  did  not  use  this  set  of  variables. 

4.  Models  with  finer  tunings  of  the  education  dummy  variables.  However,  since  the 
distribution  in  both  data  sets  (Stay- Loss  stage  and  Promotion  stage)  was 
practically  bi-model  (clustering  on  BA  and  MA),  the  only  two  meaningful 
categories  we  could  analyze  are  dummy  variables  corresponding  to  BA  and  MA. 

5.  Finally,  because  APC  is  missing  for  many  individuals  in  the  data  (e.g.,  roughly 
33%  of  cases  and  36%  of  individuals  in  SWO)  we  tried  estimating  these  models 
based  only  on  the  subset  of  the  data  where  APC  was  observed.  The  qualitative 
findings  of  the  models  did  not  change. 
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